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Abstract 

Transgenic crops are very much in the news due to the increasing public debate on their acceptance. In the scientific 
community though, transgenic plants are proving to be powerful tools to study various aspects of plant sciences. 
The emerging scientific revolution sparked by genomics based technologies is producing enormous amounts of 
DNA sequence information that, together with plant transformation methodology, is opening up new experimental 
opportunities for functional genomics analysis. An overview is provided here on the use of transgenic technology 
for the functional analysis of plant genes in model plants and a link made to their utilization in transgenic crops. 
In transgenic plants, insertional mutagenesis using heterologous maize transposons or Agrobacterium mediated 
T-DNA insertions, have been valuable tools for the identification and isolation of genes that display a mutant 
phenotype. To discover functions of genes that do not display phenotypes when mutated, insertion sequences have 
been engineered to monitor or change the expression pattern of adjacent genes. These gene detector insertions 
can detect adjacent promoters, enhancers or gene exons and precisely reflect the expression pattern of the tagged 
gene. Activation tag insertions can mis-express the adjacent gene and confer dominant phenotypes that help bridge 
the phenotype gap. Employment of various forms of gene silencing technology broadens the scope of recovering 
knockout phenotypes for genes with redundant function. All these transgenic strategies describing gene-phenotype 
relationships can be addressed by high throughput reverse genetics methods that will help provide functions to the 
genes discovered by genome sequencing. The gene functions discovered by insertional mutagenesis and silencing 
strategies along with expression pattern analysis will provide an integrated functional genomics perspective arid 
offer unique applications in transgenic crops. 



Genomics 

Genome sequencing of a number of organisms is 
providing the scientific community with a vast re- 
source of DNA sequence information that is revo- 
lutionizing the way science is being done. Due to 
technological breakthroughs DNA sequence inform- 
ation of small and medium sized genomes is being 
uncovered *en masse'. An organism that used to be 
studied at the individual gene level can now be ex- 
amined in terms of its genome organization, expres- 
sion and interaction - a field that has been given 
the new buzz word name 'genomics' . The science of 
genomics is the study of how genes and genetic in- 
formation are organized within the genome, and how 



this organization determines their function. This sci- 
ence was given an impetus by a bold plan to sequence 
the human genome that stimulated the development 
of efficient and cheap high throughput (HTP) sequen- 
cing techniques. A large number of microbial genomes 
are being sequenced, with six archaea and 17 bacteria 
already in public databases while the genomes of six 
more bacteria are completed. Among the eukaryota the 
complete genomes of yeast and Caenorhabiditis have 
been published (Mewes et al., 1997; The C. elegans 
sequencing consortium, 1998) while that of Drvso- 
phila is recently reported to be complete. Among 
higher eukaryotes six complete chromosomes have 
been published including one human and two Ara- 
bidopsis chromosomes. The complete genomes have 
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already entered the post-genomic era where gene func- 
tions are being systematically uncovered by functional 
genomics methodology. 

Model genomes 

Plant genomics has taken root with the establishment 
of model plants like Arabidopsis that set a large group 
of scientists to dissect this plant using the tools of 
genetics, biochemistry, physiology and molecular bio- 
logy. Arabidopsis is a weed that grows all over the 
world, has one of the smallest plant genomes ( 140 Mb, 
or million base pairs), is easy to grow, transform and 
use for genetic experiments. Structural genomics re- 
search starting from genetic and physical maps to HTP 
whole genome sequencing, is a publicly funded mul- 
tinational effort from research groups in Japan, Europe 
and the United States. The sequences of chromosome 
2 (19.6Mb) and chromosome 4 (1 7.38 Mb) have re- 
cently been published (Lin et al., 1999; Mayer et al., 
1 999) and the whole genome sequence will be avail- 
able by the end of this year. The Arabidopsis genome 
sequence provides us with a blueprint of the basic set 
of genes that is required for the growth and mainten- 
ance of a simple plant, but the systematic correlation 
of gene sequence and specific function in the plant 
still needs to be done. Some of the remarkable insights 
emerging from whole chromosome sequences are the 
number of gene clusters, large-scale duplications of 
chromosome segments, insertions of huge chunks of 
mitochondrial DNA, as well as 40% of newly dis- 
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temational Arabidopsis sequencing effort is concen- 
trating on sequencing the ecotype Columbia, genome 
sampling or random sequencing of another ecotype, 
Landsberg erecta, by groups including TIGR that has 
15,000 sequences on its website (www.tigr.org), will 
contribute many markers including SNPs (single nuc- 
leotide polymorphisms) for fine mapping of mutants. 
The similarity of genes and gene order between Ara- 
bidopsis and its closest relatives the crucifers (Gale & 
Devos, 1 998) will facilitate the direct use of Arabidop- 
sis genomic and functional information in the crop 
plants. 

While Arabidopsis is very much the model dicot 
plant, rice with its relatively small genome (450 Mb) 
has been selected to be the model monocot plant 
whose genome will be sequenced by a multinational 
effort. The other cereals like maize, barley and wheat 
have genomes of size 3000, 5000 and 16000Mb, re- 
spectively that will not soon be sequenced completely. 



The demonstration that genetic selection for gene rich 
sequence clones can be made for maize genome se- 
quencing (Rabinowicz et al., 1999), suggests that 
shotgun sequencing of the gene rich part of large gen- 
omes will be a feasible method for gene discovery. The 
gene order or synteny between genes in the monocots 
is well conserved (Gale & Devos, 1998) so that the 
compact rice genome sequence will help to decipher 
the gene order and content in the more complex gen- 
omes. As the monocot model is also a crop plant, 
with a half billion tons grown annually worldwide and 
90% used for consumption, deciphering this model 
becomes attractive for commercial applications. An 
additional genome model gaining recognition is the 
legume Medicago truncatula to dissect symbiotic ni- 
trogen fixation (Nam et al., 1999). Besides the model 
genomes, some plants may be considered models due 
to their characteristic advantages, for example, maize 
genetics, tobacco cell culture, tomato fruit biology. 
The accessibility of genomic biology approaches to 
these and genome models will help our understanding 
of plant biology from a problem oriented (bottom-up) 
approach as well as the model genome (top-down) 
approach. 

EST sequencing 

The expressed genes of a plant can be effectively dis- 
covered by sequencing expressed sequence tags (ESTs 
or complementary DNA from transcribed genes) that 
reveal the active genes in the genome in a specific 
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available in public databases from 19 plant species 
like Arabidopsis, rice, tomato, maize, soybean, cotton, 
loblolly pine etc, and offer an efficient way of gene 
discovery in these plants. The ESTs are redundant 
with highly expressed genes represented many times 
so sequence comparisons are required to quantify the 
number of genes. 

There are 37,746 Arabidopsis ESTs (H5fte et al., 
1993; Newman et al., 1994) submitted to dbEST, 
and after comparisons to characterized expressed and 
predicted transcripts, a set of about 16,000 unique 
transcripts has been identified in the Arabidopsis Gene 
Index (Quackenbush et al., 2000). Previous analysis 
(Rounsley et a)., 1996) suggested that ESTs exist for 
around 55-60% of the Arabidopsis genes, and this is 
confirmed with the present genomic sequencing res- 
ults. EST comparisons could also be used to define 
unique tags to individual members of multigene fam- 
ilies like cytoplasmic ribosomal proteins (Cooke et al., 
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ich 1997) where 50 different proteins are encoded by 106 

se - genes. 

hat For other plants dbEST contains 46,268 rice, 

29,209 maize, 26,255 tomato and 17,738 soybean 
Tie ESTs that have been classified into individual Gene In- 

ots dices (Quackenbush et al., 2000). Surprisingly though, 

the proprietary databases, for example from DuPont, Con- 

ner tain 127,247 wheat, 389,733 maize, 114,162 rice and 

sn- 183,663 soybean ESTs (Meyers et al„ 1999). These 

wt, efforts lend support to the idea that for gene discov- 

wd ery in the larger genomes we need not have to wait 

del for genome sequencing. To avoid the redundancy of 

An random EST sequencing, normalized cDNA libraries 

the have been used to create a 'unigene* set of clones 

n *- representative of the genome, 

del A comparison of EST databases from different 

iue plants and tissue/conditions reveals the diversity in 

ize coding sequences between plants, yet provides a 

gy* global perspective of the similarities in genes for spe- 

t° cific tissues (e.g. ripening) or conditions (e.g. patho- 

ing gen induced). The sequence similarity between the 

up) plant genes, analyzed by bioinformatics tools, permit 

vn ) assignment of probable gene function and identifica- 

tion of orthologs between species. Even though sim- 
ilarity screens to databases can assign some degree 
of general function to genes for transcription factors 
or signal transduction, their exact function requires 
lis- experimental approaches. 

*Ts 

hat p unctional genomics approaches 
ific 



The field of functional genomics has blossomed forth 
to specifically address the function of genes dis- 
covered by genome sequencing. In contrast to the 
previously prevalent gene by gene approaches, new 



ant HTP methods are being developed for expression ana- 

oes lysis as we j] as the recovery and identification of 

the mutants. The experimental approach is consequently 

changing from a hypothesis driven procedure to a non- 
biased data collection and archiving methodology that 
ST, makes these data available in relational databases that 

rod can be analyzed later by bioinformatics tools. The 

jue functional genomics methodology is also changing the 

;ne experimental strategy from a Forward Genetics, that 

'sis is, mutant to gene approach, to a Reverse Genetics 

f° r or sequenced-gene to mutant and function approach 

5 is (Bouchez & Hdfte, 1998). Though still at its infancy, 

functional genomics of model plants is expected to 
ine bear fruit towards the understanding of basic plant 

un- biology, as well as the exploitation of genomic inform- 

al* ation for crop plant improvement. This is because a 



large number of gene-functions for generic traits will 
be functional across species, either directly or after 
identifying the functional homologues. 

Expression analysis 

The major developments in whole genome analysis 
have been in the field of transcript expression ana- 
lysis using a variety of HTP methods. In plants 
different technologies have been employed based on 
high-density nylon filters (Desprez et al., 1998), mi- 
croarrays (Schena et al., 1995), various gel systems 
(Baldwin et al., 1999) and even a sequencing-based 
method SAGE (Matsumura et al., 1999). Some novel 
systems that offer much promise are a new gel based 
strategy called GeneCalling by CuraGen Corporation 
(Bruce et al., 2000) and a revolutionary microbead 
system from Lynx Therapeutics (Brenner et al, 2000). 
The most popular are expression microarrays on chips 
due to the availability and convenience of the tech- 
nology that is also applicable to non-model plants 
(Lemieux et al., 1998). With gene chips or microarrays 
all the genes of an organism (as ESTs or synthes- 
ized oligonucleotides) can be placed on a solid support 
like glass and used in hybridization experiments with 
different RNA samples to reveal the gene expression 
patterns. This is quite commonly used in yeast but 
progressively being employed in more complex organ- 
isms, ranging from Arabidopsis (Ruan et al., 1998) 
to maize and humans. The expression profile of a 
developmental stage or induced condition can help 
identify genes and coordinately regulated pathways 
and their functions deduced by association. However, 
the expression patterns of genes supply correlative 
information and do not necessarily prove a causal re- 
lationship between gene sequence and function. As 
factors other than mRNA level alone, for example, 
protein modification or metabolite flux, determine the 
activity of a gene product in situ, even genome wide 
expression analysis cannot prove a gene function rela- 
tionship and other supplementary experimental data is 
required. 

The real picture of gene expression is offered by 
proteomics (Dove, 1999) that addresses the protein 
expression of a cell type, the protein modifications 
and can also reveal actual interactions in the cell. 
Recent developments in mass spectrometry have sim- 
plified protein analysis and characterization and have 
become automated. The availability of whole gen- 
ome sequences in databases provides the basis for 
the identification of peptide tags from analyzed pro- 
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teins. The proteome or protein content of a cell type, 
is limited to normally expressed proteins while min- 
imally expressed products are beyond the limits of 
resolution, probably requiring separate enrichment or 
sensitization methods. As the proteome is also quite 
complicated to analyze for plant cells with their cell 
walls, the metabolome provides a picture of all meta- 
bolites that are present in the cell/tissue type revealing 
the active biochemical processes. Methods for plant 
metabolite profiling (Trethewey et aL, 1999) are still 
at infancy and due to the nature of the system are 
very prone to uncontrolled variation. Further tech- 
nological developments along with genome analysis 
information hold great promise for the analysis of the 
plant proteome and metabolome. Expression analysis 
tools combined with mutant or over-expression ana- 
lysis of genes will then be able to provide a unique 
multi-dimensional picture of genetic circuits and 
pathways. 

Gene-phenotype relationship 

The phenotype associated with a gene function is 
often the best clue to its role in the plant Mutant 
phenotypes can be broadly defined at die morpholo- 
gical, biochemical or physiological levels and provide 
information on the interaction between different pro- 
cesses. Classically, chemical and physical mutagens 
have provided loss of function recessive mutants and 
have helped define genes involved in specific path- 
ways or processes. To study the interaction between 
genes, mutants in a particular pathway can be com- 
bined and the genetic hierarchy studied. The genetic 
circuits of simple processes like pigmentation have 
been defined in a number of plants (Mol et aL, 1998) 
while others like the ethylene response pathway (So- 
lano & Ecker, 1998) are best studied in models like 
Arabidopsis. 

The genetic analysis of mutants requires good 
plant systems that are easy to handle and are genetic- 
ally simple with minimal gene redundancy. Arabidop- 
sis, maize, tomato, petunia and barley have served 
as good systems to isolate mutants using chemical or 
physical agents and map the corresponding loci on 
chromosomes. These mutants can then be precisely 
mapped and lead to gene isolation by map-based clon- 
ing procedures from large YACYBAC libraries. Contigs 
or physical maps of; YAC/BAC clones are available 
for Arabidopsis (Schmidt et al„ 1995) and this fa- 
cilitates tbe:Kle|itifiCation of the corresponding gene. 
Once the gerieifhas been identified, complementa- 



tion of the mutant by transformation (Arondel et ah, 
1992) is the most straightforward way of establish- 
ing the gene-phenotype causal relationship. The proof 
of isolation of the gene corresponding to the mutant 
can also be demonstrated by sequencing a number 
of alleles (Shirasu et al., 1999) in cases where com- 
plementation of the mutant by transformation is not 
easy. 

Two features determine the ease of gene identifica- 
tion by map based cloning approaches: the availability 
of linked markers and transformation protocols. Mark- 
ers can be generated at high efficiency in any plant 
by methods like AFLP using bulked segregation ana- 
lysis. In Arabidopsis the generation of SNPs and 
HTP screens in segregating populations (Cho et al., 
1999) can aid in determining the precise map po- 
sition within a few weeks. Together with efficient 
transformation systems like the floral dip method in 
Arabidopsis (Clough & Bent, 1998) gene-function re- 
lationships with mutants can be established in a semi 
HTP manner. Consequently, with the help of the gen- 
ome sequence, the genes corresponding to all the 
known mutants can be identified in a cost-effective 
way. 

Functional genomics with heterologous insertion 

Knockout mutagenesis 

Insertional mutagenesis for gene identification of- 
fers several advantages for functional genomics over 
mutants derived from chemical/physical treatments. 
Two types of insertion sequences are commonly used 
in plants: transposable elements and Agrobacterium 
tumefaciens mediated T-DNA (transfer DNA) inser- 
tions. The insertion sequence often causes a knockout 
mutation (Figure IB) by blocking expression of the 
gene and might display a mutant phenotype. The 
mutant gene that is tagged by the insertion sequence 
can then be isolated by recovering DNA flanking the 
insert and subsequently lead to the isolation of the 
wild-type gene sequence. This method of gene tag- 
ging with well-characterized insertion sequences as 
molecular tags does not need prior knowledge of the 
gene product or expression. The advent of efficient 
insertional mutagenesis strategies, first with the endo- 
genous transposons in maize, later with T-DNA in the 
model Arabidopsis and heterologous transposons in a 
number of good genetic systems has rapidly facilitated 
gene identification and isolation (Pereira, 1998). 
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al., insertion The identification of specific mutants is the 

sh- ^ . primary rate-limiting step but consequently leads to 

oof E p gnr^ft 1 r "IftSHj. / the identification of the gene. The most disturbing fea- 

ant — ^ ^ — j^j ' "—^ — : — ture of an insertional mutagenesis strategy is that the 

ber process of transformation often causes 'somaclonaT 

>m- W^f&W mutations that are not tagged by the T-DNA inser- 

not 8 / tion. This frequency of non-tagged mutants is quite 

ll^K/ te;:-.. ^ high in tissue culture transformation systems as was 

ca- observed for tomato (Yoder, 1990) and Arabidopsis 

lity WHWmW (Koncz et al., 1992). The major advance in the use 

irk- C of T-DNA tagging was made in Arabidopsis, where 

ant p\jf U |j :1> non-tissue culture transformation approaches like seed 

na- transformation (Feldmann, 1991), vacuum infiltration 

and ' 'mBB&^ y (Bechtold & Pelletier, 1998) or floral-dip (Clough & 

al., jy ^^mW Bent, 1998) methods displayed a much lower propor- 

po- ^ pv^f I i ^ tion of untagged mutants. In any case, these forward 

ent — genetics strategies for recovering T-DNA insertion 

I in ^ mutants require careful genetic analysis and proof 

re- ^B!mW by complementation after transformation. For for- 

;mi ^ . A / . ward genetics, T-DNA insertional mutagenesis is only 

en- ^ — : fl Vll^ ^ practical on a large scale for Arabidopsis and has res- 

the ulted in a large number of tagged genes. Molecular 

live ^piip^K. analyses of tagged mutants (Azpiroz-Leehan & Feld- 

\ //^ mann, 1997) have shown that insertions can occur all 

\/ ±> W> - -1 D^i {> over a gene, even insertions in introns and promoters 

revealing mutant phenotypes. Moreover, the tagged 

II genes are distributed randomly over all chromosomal 
G \^*" : " : -"^^ ^ regions. For reverse genetics strategies, many T-DNA 

\y ► Ijir&fl ffij^ffi^P' insertion populations have been developed in Ara- 
bidopsis (See Table 1), but are equally applicable in 

0 f. ■•^Jft y °* er ^ e ^ ce efficient transformation pro- 

, ver II V' cedures. Another knockout mutagenesis tool in rice is 

nts. \ / Ma^ l F 7 ^ ■ ^ ' ^f s, 011 Oogenous retrotransposon Tosl7 that has been 

se< j ^ developed for reverse genetics screens (Hirochika, 

; Figure /. Insertion sequences for gene function identification. 1997) 

um Loss-of-function mutations are possible with insertion types B-E, ' , , „ 

*er- Gain-of-fanction mutations with types G & H, Gene expression Transposon tagging has been an effective strategy 

ou t detection with types C-F. (A) Wild-type gene structure show- to identify genes in plants like maize with its well- 

the ^8 different components with symbols: E-enhancer, P-promoter, studied endogenous transposons (Walbot, 1992). The 

Ascription start, thick arrow- translation start ATG, exon 1 maize systems Ac _ Ds (Baker et ^ 19g6) 

l,,c & 2 of standard gene as shaded box with intervening intron, pA- , _ r v ... . D . . trtork f ' 

nee transcription termination site, insertion- T-DNA or transposon as ^ d En-l(Spm) (Pereira & Saedler, 1989; Masson & 

the triangle insert in gene. Hun arrow indicates gene transcription; thick Fedoroff, 1989), have been shown tO transpose after 

t j ie a" 10 * indicates reporter gene within insertion sequence. (B) Knock- being introduced into numerous heterologous hosts. 

as- T^fT ^ thin -l oding l g ™ bl ^ g ^f™- These heterologous transposons can be modified in 

a s (C) Promoter trap with insertion-reporter under transcriptional con- & r 

as trol of tagged gene. (D) Gene trap with insertion reporter lacking vitw and ^us offer several advantages for transposon 
the translation start, as translation fusion with tagged gene. (E) Exon mutagenesis over the endogenous systems (Pereira, 
trap (gene trap) with insertion reporter bearing a splice acceptor 1998). To develop effective tagging strategies, two 
SA to make gene fusions with tagged gene. (F) Enhancer detection * . „ _ • • ... 
do- with Insertion Reporter displaying flaiLg gene enhancer activity. component systems, comprising a mobile transpo- 
se (G) Activation tag with insertion carrying a strong enhancer that son component (Ds or I/dSpm) and the corresponding 
n a activates the expression of adjacent genes. (H) Promoter read-out stable transposase (Ac or En/Spm) source, have been 
. insertion with a strong promoter reading out into adjacent gene , subsequently used (Bancroft et al, 1992; AartS et al., 
causing mis-expression. ™ , 

1993). The mobile transposon components are engm- 
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Table 7. Insertion sequence tags used for reverse genetics in Arabidopsis 
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DsG/DsE 


NPTII 
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2,000 lines 




X 


Martienssen, 1998 


En-1 




-6 


8,000 lines 


X 




Wismanetal., 1998 


I/dSpm 




-20 


2,592 lines 


X 


X 


Speulman et al., 1999 


dSpm 


BAR 


1-low 


48,000 lines 


X 


X 


Tissieretal., 1999 


DsG 


NPTII 


1-low 


931 lines 




X 


Parinov et ah, 1999 



PT - Promoter trap; ET - Enhancer trap; GT - Gene trap. 



eered to carry various marker or selectable genes like 
antibiotic or herbicide resistance genes. In order to 
monitor or select their excision transposons are of- 
ten inserted in other assayable/selectable marker genes 
(Baker et al., 1987; Jones et al., 1989). In attempts 
to control transposition, the transposase is often put 
under the regulation of heterologous promoters (Swin- 
burne et al M 1992), or segregated out in progeny to 
yield stable transposon inserts. The first few heterolog- 
ous tagged mutants in Arabidopsis (Aarts et al., 1993; 
Bancroft et al., 1993; Long et al., 1993) and petunia 
(Chuck et al., 1 993) were obtained by a random tag- 
ging strategy, in which plants containing transposed 
elements were selfed and screened for obvious mutant 
phenotypes. The frequency of random inserts display- 
ing a mutant phenotype is about 1-5%, depending on 
the screening strategy used, similar to that observed 
for T-DNA inserts. 

As transposons move preferentially to closely 
linked sites (Jones et al., 1990; Bancroft & Dean, 

1993) , this feature is useful to efficiently isolate 
mutants for genes located near the original position. 
In heterologous systems this is achieved by mapping a 
large number of transformed T-DNA inserts that serve 
as jumping pads for local mutagenesis. This strategy 
was effectively demonstrated in tomato where 'tar- 
geted tagging' with a Ds transposon linked about 3 cM 
away from the target gene yielded tagged mutants at a 
high rate of about 1/1 ,000 transpositions (Jones et al., 

1994) . Similar directed tagging strategies have since 
been used to tag genes in some of the larger plant gen- 



omes and have made this strategy a practical reality for 
gene isolation. 

Gene detector insertions 

Most established genetic descriptions of biological 
processes come from loss-of-function or knockout 
mutations. Yet, in most higher organisms the ma- 
jority of genes display no obvious knockout phen- 
otype (Burns et al., 1994). Part of the reason for 
this 'phenotype gap' could be functional redundancy, 
where one or more other loci (mostly homologous) 
can substitute for the same function. In fact even in 
Arabidopsis with its simple genome large chromo- 
somal segments are duplicated (Lin et al., 1999; Mayer 
et al., 1999) comprising redundant genes. Sequential 
disruption of homologous and redundant genes in an 
individual genotype might ultimately reveal a mutant 
phenotype. Some subtle mutants might require a thor- 
ough screening system for their detection, while others 
have conditional phenotypes and are revealed only 
when challenged with appropriate environmental cues 
like pathogens or abiotic stress. Quite often a mutant 
phenotype reflects only a part of the gene function, 
primarily the first expressed defect, for example, in 
embryo lethal mutants that may have wide-ranging 
functions from development to metabolism. Even in 
the simple model Arabidopsis the frequency of re- 
covered lethal mutants is low when compared to other 
organisms (Miklos & Rubin, 1996), suggesting that 
a large proportion of gametophytic knockout mutants 
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might be difficult to recover. All these reasons suggest 
that other gene detection systems rather than standard 
knockouts are required for the identification of gene 
function. 

The principle that expression patterns arc use- 
ful parameters to assign gene function is extensively 
used in the application of expression microarrays and 
related technology. However, microarray based tech- 
nology does not have the resolution to monitor the 
expression of specific genes within single or few cells. 
Entrapment strategies (Skames, 1990), outlined in 
Figure 1, make use of inserts containing reporter gene 
constructs, whose expression is dependent on tran- 
scriptional regulatory sequences of the adjacent host 
gene. In this way genes (adjacent or knockouts) can be 
identified by their expression pattern, even though they 
might not directly display an obvious mutant phen- 
otype. The identification of very specific expression 
patterns, for example, in the meristem or localized 
after pathogen induction, can suggest the possible 



for phenotype to screen for. The entrapment inserts al- 
low for the selection of inserts in specific classes of 
genes, based on their expression pattern. Subsequent 
production of double or multiple mutants in a path- 
way or between partially redundant genes, as indicated 

teal by the expression pattern, might finally reveal mutant 

out phenotypes. 

na- Enhancer detection was developed to detect enhan- 

en- cers in the genome, which are capable of orientation 

for independent transcriptional activation from a distance, 

icy, and thus identify genes based on their expression pat- 

•us) tern. Enhancer detection constructs (Figure 1 F) con- 

i in tain a reporter gene like (^-glucuronidase (GUS) with a 

no- weak or minimal promoter, for example with a TATA 

yer box situated near the border of the insert. When in- 

tial tegrated in the vicinity of an enhancer sequence in 

an the genome, the reporter gene can display the expres- 

ant sion pattern of the nearby chromosomal gene. The 

tor- inserts will be activated from a distance and independ- 

ers ent of the orientation of the reporter to the tagged 

nly plant gene, so the frequency of enhancer detector in- 

ues sertions displaying some expression pattern is as high 

ant as 50%. One disadvantage of these inserts is that it is 

on, difficult to pinpoint the exact location of the tagged 

in gene that might be located at an unpredictable dis- 

ing tance especially in large complex genomes. In a com- 

in pact or completely sequenced genome like Arabidop- 

re- sis the gene prediction and identification should be 

her easier. 

hat 'Gene trap* type inserts (Figure ID) are de- 
nts signed to create fusion transcripts with the target gene 



(Skarnes, 1990). In plants the reporter genes nptii 
and gusA are able to support gene fusions and have 
been used widely in T-DNA vectors as gene fusion 
traps (Topping & Lindsey, 1995). Promoter trap types 
(Figure 1C) consist of a promoterless reporter gene 
and are expressed when inserted downstream of the 
chromosomal gene promoter. The exon trap type (Fig- 
ure IE) is more versatile as it enables reporter gene 
fusions to be created at various locations within a 
gene. By introduction of splice acceptor sites upstream 
of the reporter gene, transcriptional fusions are cre- 
ated even for insertions in introns and thus increases 
the frequency of inserts expressing the reporter gene. 
Reporter gene fusions can help in localization of the 
gene product, for example, in the nucleus, organelle 
or membrane. 

The first T-DNA vectors were designed as pro- 
moter traps (Koncz et al., 1989) with the nptii reporter 
gene. Later the gusA reporter gene was employed for 
enhancer and promoter traps (Kertbundit et al., 1991 ; 
Topping & Lindsey, 1995). These studies revealed that 
a high proportion of inserts displayed reporter gene 
expression, with about 25% for promoter traps and 
50% for the enhancer traps. Most surprisingly the fre- 
quency of expressed inserts was similar in Arabidopsis 
and larger genomes like tobacco (Koncz et al., 1989), 
suggesting that T-DNA inserted preferentially in tran- 
scriptionally active regions. Translational gene fusion 
vectors (Figure 1 D) with the nptii gene lacking a trans- 
lation start and placed near the T-DNA border (Koncz 
et al., 1989; Babiychuk et al., 1997) yielded kana- 
mycin resistant calli/seedlings and also resulted in the 
identification of gene knockout mutants. There are 
large T-DNA collections being produced now in Ar- 
abidopsis that employ promoter trapping (Table 1 ). 
These include the vacuum infiltration derived Ver- 
sailles T-DNA collection (Bechtold & Pelletier, 1998), 
a collection of 11,370 (Campisi et al., 1999) lines 
available through stock centers, and a cell culture 
transformed population (Mathur et al., 1998) that has 
been analyzed by sequencing. Although large col- 
lections of T-DNA transformants for enhancer and 
promoter traps have been produced, analyzed, and 
scores of interesting patterns identified, it has not been 
easy to make the exact correlation between the reporter 
expression pattern and the resident gene. One reason 
could be due to the presence of spurious enhancers and 
promoters in the genome that are not associated with 
the expression of the adjacent genes but are picked 
up by these expression detectors. Another reason is 
the difficulty in determining the position of the ex- 
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pressed trapped gene, especially in enhancer traps and 
with rearranged or multiple T-DNA inserts in complex 
genomes. 

For transposons similar strategies were employed 
for the Ac-Ds transposons engineered as promoter 
and enhancer trap systems. A Ds, carrying a CaMV 
minimal promoter upstream of the gusA reporter gene 
and a hygromycin resistance marker, was mobil- 
ized by an Ac transposase with linked iaah gene 
as a negative selection marker (Fedoroff & Smith, 
1993). With these selection markers stable trans- 
posed Ds elements could be recovered in the progeny. 
A novel gene LRP1 expressed in root development 
(Smith & Fedoroff, 1995) was identified that dis- 
played no mutant phenotype, probably due to gene 
redundancy. An embryo-defective lethal mutation was 
also identified (Tsugeki et al., 1996) whose expres- 
sion could be analyzed in heterozygotes, demon- 
strating the use of enhancer entrapment for function 
identification. 

A novel method for efficient selection of stable 
transpositions has been developed for an Ac—Ds 
based gene (exon) trap and enhancer trap system 
(Sundaresan et al., 1995). From the progeny of starter 
plants carrying heterozygous Ac-Ds elements, stable 
transposed Ds elements carrying the nptii gene are 
selected using the negative selectable iaaH marker 
gene for segregating out the transposase. Independ- 
ently derived Ds inserts were screened for their GUS 
expression pattern and revealed some GUS reporter 
gene activity in about 50% enhancer trap (DsE) and 
25% gene trap (DsG) inserts (Sundaresan et al. r 1995). 
The first gene identified by gene trapping, PRO- 
LJFERA a gene expressed in megagametophyte and 
embryo development (Springer et al., 1995), exem- 
plifies the utility of the system. A number of starter 
DsG and DsE lines have since been used by different 
collaborators to produce about 10,000 stable trans- 
posed elements, many of which have been screened 
for their expression patterns (Martienssen, 1998). A 
similar strategy for selection of stable transposed 
gene trap DsG elements has been initiated in rice 
(Chin et al., 1999) using greenhouse selectable marker 
systems. 

Mis-expression mutants 

The classical mutational spectrum using chem- 
ical/physical mutagens or knockout insertions reveals 
a low frequency of mutant phenotypes probably due 
to redundant gene functions, conditional and subtle 



phenotypes. Gain-of-function mutants by the spatial or 
temporal mis-expression of individual genes provides 
an alternative way to perturb gene function or reg- 
ulatory networks. One way this can be achieved is 
by employing insertion sequences that carry a strong 
enhancer element near the border, thus activating the 
adjacent gene ectopically or just enhancing its expres- 
sion. The recovery of over/mis-expression mutants 
may be advantageous in cases where positive selection 
is possible for dominant mutations, or in processes 
or genes where simple knockout mutants reveal no 
mutant phenotype. 

This method of 'Activation tagging' has been ap- 
plied successfully in Arabidopsis (Kakimoto, 1996), 
with a T-DNA vector containing multiple strong 
CaMV enhancer sequences near the border (Fig- 
ure IG), and selecting transformants for cytokinin 
independent regeneration. A similar activation tag- 
ging screen was used for the identification of an early 
flowering mutant that turned out to be due to activa- 
tion of a gene FT (Kardailsky et al., 1999), which was 
characterized independently (Kobayashi et al., 1999) 
as a knockout mutant with a late flowering phenotype. 
The dissection of genetic pathways using second-step 
mutants, was demonstrated by the selection of a sup- 
pressor of a phytochrome phyB mutant with activation 
lagging (Neff et al., 1999). The identified phenotype 
was shown to be caused by enhanced expression of 
a gene BASl involved in Brassinosteroid levels, and 
demonstrates a link between the light response and 
hormone signaling pathways. 

An extensive description of screening of" mere than 
25.000 T-DNA activation tags has been published 
recently (Weigel et al., 2000), revealing a 1/1000 
frequency of confirmed dominant mutants. This fre- 
quency is surprisingly much lower than that observed 
for recessive knockout mutant screens, suggesting that 
there is either a bias in the activation tag insertional 
spectrum or limitations (e.g. insulators) to the inter- 
action of the introduced enhancer and host promoters. 
The dominant mutants were shown to be caused by 
presence of the enhancer tag 0.38-3.6kb from the 
overexpressed gene. These gain-of-function mutants 
provide new variants for gene function analysis. 

A transposon construct variant (Figure 1H) to isol- 
ate dominant gain-of-function alleles employed the 
CaMV 35S promoter transcribing outward from the Ds 
transposon end. The first gene tagged with this system 
was TINY (Wilson et aL, 1996), which was recovered 
as a semidominant overexpression mutant. Later an- 
other semidominant dwarf mutant was obtained that 
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.or exhibits reduced gibbercllin response due to overex- 

jes pression of 5//A a regulatory gene containing a zinc 

;g- finger region. By this system, dominant mutations 

is might be caused by insertions in the transcription unit 

•ng over/mis-expressing the gene. But it might be expected 

:he that insertions producing an anti-sense transcript also 

es- produce dominant mutants due to suppression. As the 

nts frequency of overexpression inserts is not expected to 

on be higher than knockout mutations by the same trans- 

5e S poson, both types of mutations will be obtained in a 

no screen. 

ip- 

q Site specific deletions 
ng 

ig- Specific genomic deletions can help characterize 

lin the function of individual genes and other chro- 

tg- mosomal structural elements like matrix associated 

:ly regions or general enhancers in the genome. In 

'a- the light of the finding of genetic redundancy due 

r as to gene family clusters on the sequenced chromo- 

►9) somes (Lin et al., 1999; Mayer et al., 1999), the 

>e. ability to create small chromosomal deletions en- 

ep compassing a set of the duplicated genes can con- 

p- tribute to gene function analysis of these multigene 

on families. 

pe Homologous recombination has not been very 

of efficient in plants to create site specific mutations 

nd and deletions. Heterologous site specific recom- 

nd bination systems like the bacteriophage PI Cre- 
lox and the yeast FLP-FRT systems have been 

an shown to work in plants to create deletions, and 

ed in combination with transposition can be used for 

30 gene identification (van Haaren & Ow, 1993). Re- 

e- combination occurs between two lox sites or FRT 

sd sites, mediated respectively by the Cre- or FLP- 

iat recombi nase . 

ial Constructs have been used (Osborne et al., 1995) 

;r- that contain the lox recombination sites, both within 

rs. the transposon (Ds~lox) and the adjacent T-DNA. 

yy After transposition of Ds-lox to a closely linked site, 

ie the sequence specific recombination sites are situated 

its close together on the chromosome, so that recom- 
bi nase mediated small deletions can be recovered (by 

►1- Cre-recombinase line crosses). Analysis of deletions 

ie has suggested that the transposed Ds elements were 

)s not distributed randomly but in clusters. The poten- 

m tial site specificity of insertions and the non-recovery 

2d of large deletions (Stuurman et al., 1998) may thus 

n- bias the generation of random deletions all over the 

at genome. 



Reverse genetics strategies 

Site selected insertional mutagenesis 

Reverse genetics strategies in yeast can use high fre- 
quencies of gene replacement to inactivate genes or 
putative ORFs, an option not available for plants. In 
Drosophila and Caenorhabditis, a reverse genetics 
technique called 'site selected insertion mutagenesis' 
was developed using transposons (Ballinger & Ben- 
zer, 1989; Kaiser & Goodwin, 1990, Zwaal et al., 
1993) to inactivate genes that had been sequenced, 
but whose function was unknown. In this strategy 
insertion mutants of the entire genome are first gener- 
ated and then individuals with an insertion in a gene 
of interest identified by PCR techniques. In maize, 
snapdragon and petunia (Walbot, 1992; Coen et al., 
1989; Gerats et al., 1990) the multiple endogenous 
-ansposon copies provide genome saturation and site 
selected insertional mutagenesis has been shown to 
work to identify knockout mutations in specific genes 
(Das & Martienssen, 1995; Koes et al., 1995). As an 
alternative to insertions, deletion mutants generated 
by chemical or physical mutagens can be employed 
in a HTP screen to identify mutants of specific genes 
described by the genome sequence (Liu et al., 1999). 

The chance of recovering an insert in a target gene 
is dependent on the population of inserts, the size of 
the genome and the size of the gene target. Using 
transgenic systems large populations of inserts can be 
generated by T-DNA or transposon inserts. These can 
be knockout insertions ?.s well as the- variety of gene 
detection and mis-expression inserts described above 
that will help gene function analysis. Currently two 
strategies for genome saturation with inserts are being 
pursued: 

(i) single stable elements, similar to the T-DNA pop- 
ulation structure, and 

(ii) multiple elements from active transposing pop- 
ulations, similar to the endogenous transposon 
systems. 

In Arabidopsis about 110,000 inserts would give an 
insert every kb, with about a 99% chance to mutate an 
Arabidopsis gene of 5kb (Krysan et al., 1999). Lar- 
ger genomes being investigated like rice, tomato and 
maize would require sizeable larger number of inserts 
unless the frequency of insertions in genes is higher. 

The selection of defined gene mutations is done by 
PCR using a pair 6f primers, one of which anneals 
to the insertion sequence and the other to the specific 
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target gene. If an insert close to the primer is present in 
the population then a PCR product specific for the tar- 
get gene can be observed. These sensitive PCR screens 
can be scaled up to HTP so that by specific pooling 
strategies the individuals containing the insert can be 
directly identified from a large population. Progeny 
of the individual containing the insert can then be 
analyzed to characterize the mutant phenotype. 

In Arabidopsis at first a T-DNA population of 5300 
transformants with about 8000 inserts, was screened in 
53 pools of 100 transformants each, to identify inser- 
tions in two members of the actin multi-gene family 
(McKinney et a)., 1995). Later, a hierarchical pool- 
ing strategy with DNA from 9100 transformed lines 
was used to recover inserts in 17 out of the 63 sig- 
nal transduction genes tested (Krysan et al., 1996). 
These reverse genetics screening techniques have been 
used to isolate and characterize mutants, providing 
new insights into gene function and validating th** 
strategy (Hirsch et al., 1998; Gaymard et al., 1998). 
At present a collection of 60,480 transformants is 
publicly available for screening by PCR through the 
Arabidopsis Knockout Facility at the University of 
Wisconsin (Krysan et al., 1999). The pooling strategy 
and organized screening done on available DNA in- 
volves a primary PCR screen on 30 DNA pools and 
after identification of a positive primary pool, a sec- 
ondary PCR screen. This identifies a pool that has to 
be sown out by the experimenter, DNA isolated and 
the individual containing the insert identified. 

There are various transposon populations of the 
En- 1 (Spm-dSpm) system available at present in Ara- 
bidopsis* outlined in Table 1 , that are also accessible to 
PCR screening by similar methods as described for the 
T-DNA pools (Wisman et al., 1998; Speulman et al., 
1999; Tissier et al M 1999). Two of the populations 
contain multiple actively transposing elements per line 
that have been accumulated by propagation of the lines 
for a number of generations (Wisman et al M 1998; 
Speulman et al., 1999). A large stable dSpm insert 
population has been recovered by an ingenious system 
(Tissier et al., 1999), using herbicide selectable mark- 
ers in the greenhouse for positive {bar gene conferring 
Basta resistance) and negative selection using the SU1 
gene (O'Keefe et al., 1994). 

In contrast to the often-complex nature of T-DNA 
insertions, transposon inserts have a defined terminal 
sequence rendering them all accessible to PCR se- 
lection. In addition, due to their ability to jump to 
adjacent sites, transposon insert derived mutations can 
also be generated for members of closely linked gene 



families. In fact, the local transposition behavior of 
the Ac-Ds transposons has been applied to saturate 
specific genomic regions with inserts (Dubois et al., 
1998; Ito et al., 1999). Thus mapped transposons in the 
genome are valuable as donor sites to produce satur- 
ated populations of inserts in the linked chromosomal 
regions. 

The reverse genetics strategies using PCR screens 
outlined above can be applied for insertion popula- 
tions that are large enough to be able to recover inserts 
in specific genes of interest. Alternatively, the DNA 
flanking an insert can be isolated, sequenced and com- 
pared to the genome sequence. There is a variety of 
PCR based methods to isolate genomic DNA flank- 
ing inserts (reviewed in Maes et al., 1999) that can 
then be sequenced. After sequencing a large number of 
these insertion sites, inserts in genes of interest, can be 
identified and their mutant phenotype analyzed. With 
more than 50% of the Arabidopsis genome sequence 
available, a number of groups have sequenced DNA 
flanking transposon inserts and been able to identify 
tagged genes very efficiently (Ito el al., 1999; Parinov 
et al., 1999; Speulman et al., 1999; Tissier et al., 
1999). These efforts are also very valuable as an un- 
biased system to test insertional specificity and also 
for the characterization of inserts in or flanking genes 
that have not yet been predicted by bioinformatics, but 
are an accumulating resource towards insertions in all 
genes. The value of sequencing transposon-flanking 
DNA has also been recognized in the larger genome 
rice where active Ac transposon populations have been 
used for the identification of inserts in ^enss b v se- 
quencing in parallel with PCR based screening (Enoki 
etal., 1999). 

Gene silencing 

Homology dependent gene silencing (HDGS) us- 
ing transgenes offers a new tool to shut down the 
function of endogenous genes, consequently generat- 
ing mutants without mutation. Two types of HDGS 
have been described (Kooter et al., 1999); tran- 
scriptional gene silencing (TGS) involving promoter 
homology/methylation and post-transcriptional gene 
silencing (PTGS) requiring homology between inter- 
acting genes in the transcribed regions. At first PTGS 
was found associated with tandem/inverted repeats 
or methylated and minimally expressed transgenes. 
To obtain gene silencing most experiments typically 
used a complete or partial cDNA clone placed down- 
stream of a strong promoter in sense or anti- sense 
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ir of orientation. In about 10-50% of the transformants a defense in plants and that viruses produce suppressors 

irate dominant mutant phenotype was observed for genes of this antiviral defense, offer clues to search for virus 

: al., with a visual effect, for example, developmental reg- vectors suitable for VIGS in Arabidopsis or larger 

i the ulatory genes, or enzymes in anthocyanin coloration monocot genomes. The ability to use VIGS to obtain 

Hur- (Meyer & Saedler, 1996). In some cases even weak loss-of-function phenotypes, including for multigene 

>mal promoters have been used to silence genes. More re- families, will help close the phenotype gap. 

cently PTGS has been suggested to be probably due 

-ens to the involvement of a form of aberrant or double Modified genes 

, u la- stranded RNA. Constructs containing repeals that spe- 

jerts cifically confer aberrant RNAs have been shown to Mutant proteins that disrupt the activity of the wild- 

>NA be particularly useful for silencing (Hamilton et al., type gene can confer dominant negative mutant pheno- 

om- 1998; Waterhouse et al., 1998). By the transforma- types (Herskowitz, 1987). These may be by mutations 

y of tion of specific gene constructs that confer aberrant in a regulatory protein that change it from an activ- 

ank- or double stranded RNA quite predictable silencing of ator to a repressor, or mutations in the DNA binding 

can homologous genes in the genome is likely. domain of multimeric proteins so that non-functiona] 

-rof This approach of HDGS is also applicable to create mixed aggregates would compete the wild-type func- 

n be mutations for multigene families, where clues to gene tion. A number of dominant regulatory mutants have 

Vith function may be obtained, although the interpretation been isolated in plants like CJ-I in maize inhibiting 

ince of a phenotype might be complicated by suppression anthocyanin (Paz-Ares et al., 1990) and ETR1 (Chang 

>NA of the other homologous members. Suppression of et al., 1993) in Arabidopsis showing insensitivity to 

itify gene activity in metabolic processes has been demon- ethylene. 

inov strated in a number of cases and forms the basis for A transgenic approach to manipulate and intro- 

al., metabolic engineering that is applicable in transgenic duce dominant mutations was demonstrated for the 

un- crops (Herbers & Sonnewald, 1996; Ohlrogge, 1999). Arabidopsis MADS box gene AG, where specific pro- 

also As a general HTP strategy for gene-function discov- tein domains were overexpressed (Mizukami et al., 

-nes ery it is too time consuming as several independent 1996) by the 35S promoter displaying novel pheno- 

( but transformants need to be generated for every gene and types, and providing an insight into protein regulatory 

i all have to be tested individually. Suppression of csscn- mechanisms in flower development. Such mutant reg- 

Icing tial genes that give a dominant lethal phenotype also ulatory forms have also been shown to work when 

D me cannot be easily recovered and will escape attention. introduced into heterologous species (Mandel et al., 

)e en A technology for silencing that lends itself to HTP 1992). By expression of an activated form of the LFY 

se- methods is virus-induced gene silencing (VIGS). This protein (LFY: VP 16) the downstream controlled floral 

ioki employs virus vectors carrying fragments from the ex- homeotic genes could be elucidated (Busch et al., 

ons of plant host genes, that can be introduced by 1999). The induction of transcription factors (Bruce 

transformation or viral infection into the host and can et al., 2000) by estradiol-inducible systems helped 

result in the suppression of endogenous gene expres- uncover coordinately regulated genes and regulatory 

sion (Baulcombe, 1999). These suppressed pheno- pathways. These examples reveal how gene/protein 

us _ types are phenocopies known mutations in host genes. engineering can be useful to uncover functions. With 

^ The mechanism is based on an RNA-mediated defense our developing understanding of protein function, 

;rat . against viruses and is also probably involved in PTGS modification and interaction, methods of protein en- 

>G S achieved by transformation methods. At present VIGS gineering will be increasingly used to make and test 

ran . involves the cloning of gene fragments of interest in hybrid, shuffled and new proteins. 

oter vectors developed from the genomes of TMV, PVX or 

, ene TGMV for applications in Nicotiana species. These 

lter . systems can be stepped up for a HTP based forward An integrated view of functional genomics 

^jS genetics screen using libraries of gene fragments (e.g. 

eats cDNA > that are used for P lant inaction with pools or Classical genetics has tended to define mutants with 

nes. individual clones. Even members of multigene famil- distinct differences that are limited to a description of a 

ally ies that have sufficient DNA homology in fragments handful of genes. Mutants that displayed weak and in- 

wn- used wil1 be effective in gene suppression. The re- consistent effects were only poorly incorporated in the 

;nse cognition that VIGS and PTGS are related to antiviral genetic models of pathways. In some genetic models, 
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quantitative trait loci have been recognized but have 
not yet been described by gene actions. Now, reverse 
genetics strategies, with the help of semiquantitative 
measures of gene action, can offer new approaches to 
judge the effect of mutations. Mutants of redundant 
genes can also be combined to obtain multiple mutant 
phenotypes that reveal novel gene functions. 

Mutants or expression variants of genes obtained 
by any of the various techniques described here, 
can be tested for changes in global gene expression 
monitored by the HTP methods for transcriptome, 
proteome or metabolome analysis. Regulatory-gene 
expression studies can help discover coordinately reg- 
ulated pathways (Bruce et aL, 2000) and using 'guilt- 
by-association* principles can uncover relationships 
between pathways. All these studies will contribute to 
a global genomic view of biology with interconnec- 
ted genetic circuits, protein interactions and metabolic 
fluxes. 

The phenotype of plants has been emphasized here 
primarily because this attribute is tangible and also 
applicable to trait development. Phenotype is beyond 
just the morphology of a mutant, but can also be de- 
scribed by gene expression manifested at the protein 
or metabolite level. The phenotype in its broadest con- 
text includes the effect of the expression of a gene, 
monitored by knockout or mis-expression, and its ef- 
fect on the expression of other transcripts, proteins and 
metabolites. 

Functional genomics is thus expanding the scope 
of biological investigation from a one-gene approach 
to a more systems-based holistic approach , The Phe- 
nome is the physical totality of all traits of an organism 
or of one of its subsystems (Manner & Kary, 1997). 
This can now be described in our emerging integ- 
rated view of the genome, transcriptome, proteome 
and metabolome of an organism. 



Genomic perspective 

Worldwide about 40 million hectares of transgenic 
plants were estimated to be grown last year (Ohlrogge, 
1999) and this might triple in the next five years. 
Over 90% of the commercial transgenic crops were 
engineered for either herbicide or pest tolerance and 
represent the first wave of crop engineering. These en- 
gineered traits provide lower production costs and/or 
higher yields. The emerging second wave of crop en- 
gineering will provide new or improved higher-value 
products for new markets and will include plant genes 



with specific functions of commercial use. These 
genes are expected to be identified by genomic-based 
strategies. 

In agricultural production a series of traits of im- 
portance in valuable crops like cereals, sugarcane, soy- 
beans, cotton, potato, woody trees, horticultural and 
ornamental crops etc, can be addressed in a generic 
perspective using gene function discovery from model 
plants. The gene pool of crop plants has diverged over 
150 million years and the diversity has arisen from 
variations in the basic building blocks of genes de- 
termining specific plant processes. The isolation of 
genes determining plant height from Arabidopsis led 
to the identification of orthologous (similar and same 
function) dwarf genes in rice and other cereals (Peng 
et al., 1999). Most surprisingly these are the dwarf 
genes that steered in the Green Revolution, that have 
been identified and incorporated by classic breeding 
programs. 

Many genes for important traits in metabolic en- 
gineering are conserved and transferable between 
model plants and crops. Some regulatory gene func- 
tions are also conserved, as demonstrated with LEAFY 
from Arabidopsis that can function in plants as di- 
verse as aspen trees and rice for manipulation of early 
flower initiation (Weigel & Niisson, 1995). Although 
the fine-tuning of such trait modification remains to be 
done, the first steps come from an understanding of 
the basic biology in the model plant. The next step is 
to analyze such a trait on a whole genome analysis, 
by identifying the interaction with other genes in the 
pathway. The identification of a few key genes from 
Arabidopsis for generic traits like resistance to the abi- 
otic stresses cold, salt and drought can also lead to 
their application in diverse crop plants that are difficult 
to study because of their complex genomes and growth 
habits. 

The generation of HTP genome and EST sequence 
information requires large capital investments that 
have been made by a number of private sector enter- 
prises with expectations of large returns in the future. 
In the light of the genomics revolution the emphasis is 
changing from Chemicals, Pharmaceuticals and Ag- 
ribusiness towards biotechnology and the life sciences 
industry. With all the top life science players involved 
in mergers, acquisitions and alliances all vying for 
life science products, including plant biotechnology, 
a lot of intellectual property generated by patenting 
genomic-based information is gathering momentum. 
The race to own information, protect and use it is on. 
This large-scale creation and protection of knowledge 
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is considered to be the central factor for improving 
world health and living standards. 

Genomics can offer us the new Green Revolution 
to solve the world's hunger problem. To achieve this 
the race in patenting and using genome information 
will have to give way to a sustainable strategic utiliz- 
ation in the developing world. Meanwhile, systematic 
plant genome function analysis will provide us with 
information that will revolutionize our view of looking 
at plants and crop production. Most importantly, our 
understanding of the genome will make the manipula- 
tion of transgenic traits more predictable and provide 
grounds for consumer confidence. 
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